A Stratification and Sampling Model for Bellwether Moving Window
نویسندگان
چکیده
An effective method for finding the relevant number (window size) and the elapsed time (window age) of recently completed projects has proven elusive in software effort estimation. Although these two parameters significantly affect the prediction accuracy, there is no effective method to stratify and sample chronological projects to improve prediction performance of software effort estimation models. Exemplary projects (Bellwether) representing the training set have been empirically validated to improve the prediction accuracy in the domain of software defect prediction. However, the concept of Bellwether and its effect have not been empirically proven in software effort estimation as a method of selecting exemplary/relevant projects with defined window size and age. In view of this, we introduce a novel method for selecting relevant and recently completed projects referred to as Bellwether moving window for improving the software effort prediction accuracy. We first sort and cluster a pool of N projects and apply statistical stratification based on Markov chain modeling to select the Bellwether moving window. We evaluate the proposed approach using the baseline Automatically Transformed Linear Model on the ISBSG dataset. Results show that (1) Bellwether effect exist in software effort estimation dataset, (2) the Bellwether moving window with a window size of 82 to 84 projects and window age of 1.5 to 2 years resulted in an improved prediction accuracy than the traditional
منابع مشابه
Prime Incision and Modified Moving Window: A Minimally Invasive Access for Breast Cancer Surgical Treatment
BACKGROUND Conservative surgical treatment has been the treatment of choice for early breast cancer. It allows feasible oncological treatment with a satisfactory cosmetic approach and fast recovery. However, in some cases mastectomy is necessary. This study proposes a surgical approach with only one surgical access through the same incision, which is in line with precepts mentioned abov...
متن کاملOperating Range Expansion in a HCCI Natural Gas Engine Using Charge and Thermal Stratification in Combustion Chamber
HCCI operating window has two distinct boundaries of knock at higher load region and misfiring/partial burning at lower load region. Moreover, there is no conventional direct way of controlling combustion timing in an HCCI engine. In this research, experimental study were carried out to investigate the effect of thermal and charge stratification on expansion of the operating range of a natural ...
متن کاملPrediction of Epileptic Seizures in Patients with Temporal Lobe Epilepsy (TLE) based on Cepstrum analysis and AR model of EEG signal
Epilepsy is a chronic disorder of brain function caused by abnormal and excessive electrical neurons discharge in the brain. Seizures cause disturbances in consciousness that occur without prior notice, so their prediction ability, based on EEG data, can reduce stress and improve quality of life. An epileptic patient EEG data consists of five parts: Ictal, Inter-Ictal, pre-Ictal, Post-Ictal, an...
متن کاملDesigning Talent Management System Template via Mixed Method in Gas Company of Sistan & Baluchestan Province
The purpose of the present study was to design a talent management system model in Gas Company of Sistan and Baluchestan province. The mixed method (qualitative-quantitative) was selected as the research method. The statistical population in the qualitative section consisted of managers and supervisors and the research data was gathered through in-depth semi-structured interviews with 11 indivi...
متن کامل“Equivalent Linear Composition” as an Efficient Stratification Factor in Multipurpose Surveys
Horticulture survey is a multi-purpose survey which is conducted ad hoc by Statistical Center of Iran (SCI). Availability of survey variables in the sampling frame suggests a multivariate stratification in each province based on its desired variables for acquiring a higher efficiency. There are several ways to stratify the sampling frame considering all stratification variables, such as using s...
متن کامل